24 



WHAT IS CLAIMED IS : / 

1. A method of voice recognition/in a noise-ridden acoustic signal 
comprising: / 

a phase of digitizing and subdividing the noise-ridden acoustic signal into a 
sequence of temporal frames, / 

a phase of parametrization of speech-containing temporal frames so as to 
obtain a vector of parameters, per frame, in the frequency domain, this vector of 
parameters expressing the acoustic contents of the frame, 
a shape-recognition phase in which me vectors of parameters are assessed with 
respect to references pre-recordea in a reference space during a preliminary 
learning phase, so as to obtain recognition by the determining of at least one 
reference which is closest to the vector of parameters, 

a phase of reiterative searchingtfor successive noise models in the sequence of 
temporal frames, a new noise/model replacing a current noise model, a noise 
model comprising several successive frames, 
wherein the method comprises: 

a phase of searching for a noise transition between the new noise model and 
the current model, / 

and wherein, when the /noise transition has been detected, the method 
comprises a phase of updating the reference space as a function of the new 
noise model, the parametrization phase including a step of matching the 
parameters to the new noise model. 

2. A method of voice recognition according to claim 1, wherein the phase 
of searching for a noise transition comprises a step of searching for an energy 
incompatibility and/or/a step of searching for a spectral incompatibility between 
the new noise modey and the current model, the detection of an incompatibility 
expressing a noise transition. 

3. A method of voice recognition according to claim 2, wherein the step 
of searching for an/energy incompatibility comprises the comparison of the ratio 
between the mearii energy E newmoc | of the new noise model and the mean 
energy of the current noise model E moc j curr with a low threshold S' and a high 
threshold S, an energy incompatibility being found if the ratio is outside the 
interval delimited toy the two thresholds S, S\ 
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4. A method of voice recognition according to^cfaim 3, wherein the step 
of searching for an energy incompatibility also comprises a comparison of the 
mean energy E newmoc j of the new noise modej/and the mean energy of the 
current noise model E moc j curr with an energy floor threshold E m j n below which 

the noise is negligible, the energy incompatibly determined by the comparison 
of the ratio between the mean energy of ther new noise model E newmoc j and the 
mean energy of the current noise model E moc j curr being ignored when the 
mean energy of the new noise modeKE newmoc j and the mean energy of the 
current noise model E moc j curr are both below the energy floor threshold E m j n . 

5. A method of voice recodnition according to claim 2, wherein the step 
of searching for spectral incompatibility comprises, on the basis of spectral 
coefficients Bj moc | CLirr Bj pew^od respectively expressing the spectral energy 

of the frames of the current npise model and the spectral energy of the frames of 
the new noise model in at/east one frequency channel i, a comparison of the 
ratio between the spectraKcoefficient Bj newmoc | associated with the frequency 
channel i of the new /oise model and the spectral coefficient Bj jmoc | curr 
associated with the same frequency channel i of the current noise model with a 
low threshold Sf' arnya high threshold Sf, a spectral incompatibility being found if 
the ratio is located c/utside the interval delimited by the two thresholds, Sf, Sf. 

6. A method of voice recognition according to claim 5, wherein the step 
of searching fo/ a spectral incompatibility also comprises, for at least one 
frequency channel i, a comparison of the spectral coefficient Bj jnewmoc j of the 

new noise mpdel in this frequency channel i and of the spectral coefficient 
B i,modcurr °f the current noise model in this frequency channel i with a floor 
spectral coefficient Bj m j n associated with this frequency channel i, namely a 

floor belovy which the noise is negligible, a spectral incompatibility determined by 
the comparison of the ratio between spectral coefficients being ignored when, 
for this frequency channel i, the spectral coefficients of the new model and of the 
current model are both below the floor spectral coefficient Bj m j n . 

i. A method of voice recognition according to claim 1, wherein the 
parametrization phase comprises a step of determining spectral coefficients 
Bj p ar / each associated with a frequency channel i each expressing a 
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representation of the spectral energy of/ a frame containing speech in the 
frequency channel i, 

the parameter-matching step comprising a determining, for each spectral 
coefficient Bj par , of a robustness operator OpRob(Bj par ), this robustness 

operator expressing the confidence to/be attached to the spectral coefficient 
B i,par with respect to the noise level of the new noise model in the same 

frequency channel i, 

a weighting of the spectral coefficient Bj par with the robustness operator 
OpRob(Bj p ar ) and 

a determining of the vector of parameters on the basis of the weighted spectral 
coefficients. 

8. A method according jo claim 7, wherein the robustness operator 
OpRob(Bj p ar ) verifies the following relationship: 



OpRob(Bpar Wmaxf o^^+^^^^ol 

^ y I Dipar-r^r ^Di/iewmod J Jj 



B ifP ar being the spectral coefficient and P(B i( newmod) being a parameter dependent 
on the noise level of the ney noise model having activated the transition in the 
frequency channel i. 

9. A method of v6ice recognition according to claim 1, wherein the 
reference space updating phase comprises the following operations, on the 
basis of the basic spectral coefficients each associated with a frequency channel 
i, each expressing the ^pectral energy of a basic frame obtained during the 
learning phase: 

the determining of a /robustness operator OpRob(Bj ( b ase ) for each basic 
spectral coefficient Bj h ase , this robustness operator expressing the confidence 
to be attached to the spectral coefficient Bjb ase with respect to the noise level 
of the new noise model in the same frequency channel i, 

the weighting of the basic spectral coefficients Bj b ase with the respective 
robustness operators OpRob(Bjb ase ) f and 

the preparation <pf the updated references with the weighted spectral 
coefficients. 
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10. A method according to claim 9, wherein the^robustness operator 
OpRob(Bj b ase ) for the updating of the reference s^a^e verifies the following 

relationship: 



OpRob(EW )=|maxf 0.25 1 ^^T^^ 



B ilb ase being the basic spectral coefficient and P(B j( newmod) being a parameter 
depending on the noise level of the Xew noise model having activated the 
transition, in the frequency channel i. / 

11. A method according to ofaim 9, in which the references are prepared 
on the basis of compressed basic spectral coefficients, wherein the method 
uses a conversion table to convert the compressed basic spectral coefficients 
into compressed and weighted^ basic spectral coefficients. 

12. A method according to claim 11, wherein the conversion table 
contains the non-compressed basic spectral coefficients Bj ^ase obtained by 

application of the reverse' of the compression function to the compressed basic 
coefficients and whereiiYthe method comprises: 

a determining of the robustness operator OpRob(Bj b ase ) for each of the non- 
compressed basic spectral coefficients Bjb ase , 

a weighting of the npn-compressed basic spectral coefficients Bj b ase , 

a compression of me non-compressed and weighted basic spectral coefficients 

so as to obtain the compressed and weighted basic spectral coefficients. 

13. A method according to claim 1, using, as references, a sequence of 
temporal frames corresponding to one or more words, this sequence of frames 
being identified by a series of vectors of parameters, these parameters being 
obtained by Compression of spectral coefficients. 

14. A method according to claim 1, using, as references, a sequence of 
temporal frames corresponding to one or more phonemes, this sequence of 
frames being identified by the center and the standard deviation of a Gaussian 
function/this center and this standard deviation depending on the parameters of 
the vectors of parameters of the frames. 

/l5. A method according to claim 1, comprising a phase of noise- 
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suppression in the speech-cont^fning temporal frames before the 
parametrization phase. 

16. A system of voice recognition in a noise-ridden acoustic signal for the 
implementation of the method acco/ding to one of the claims 1 to 15, wherein 

5 the system comprises: 

means to acquire the acoustic signal, digitize it and subdivide it into temporal 
frames, 

a parametrization chain to translate the temporal frames containing speech into 

vectors of parameters in the frequency domain, 
10 shape-recognition means with la reference space acquired during a learning 

stage, to compare the vectors/of parameters coming from the parametrization 

chain with the references, so as to obtain recognition by the determination of a 

reference that most closely approaches the vectors of parameters, 

means for modelling the noise to reiteratively prepare noise models, a new noise 
15 model replacing a current no/se model, 

means for detecting a noisp transition between the new noise model and the 

current noise model, 

means to match the parametrization chain with the noise of the new noise model 

having activated the noise transition, 
20 means to update the references of the reference space as a function of the 

noise level of the new noise model having activated the noise transition. 

17. A system of /voice recognition according to claim 16, wherein the 
means used to update the references of the reference space comprise a first 
memory space to store the updated references, these updated references 

25 having to replace current references used for shape recognition before the 
detection of the noise/transition, these current references being stored in a 
second memory space/ 

18. A voice-recognition system according to claim 16, comprising a 
memory space to store compressed basic spectral coefficients obtained from 

30 basic spectral coefficients each associated with a frequency channel i, these 
basic spectral coefficients each expressing the spectral energy of a basic frame 
coming from the learning stage, a conversion table to convert the compressed 
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basic spectral coefficients intO/dbmpressed and weighted basic spectral 
coefficients, each weighted by'a robustness operator OpRob(Bj b ase ) as a 

function of the noise \ewe)/oi the new noise model having activated the noise 
transition and of the ba^ic spectral coefficient (Bjb ase ) to be weighted, these 

compressed and weighted basic spectral coefficients being used for the 
updating of the references of the reference space. 

19. A voice-recognition system according to claim 16, comprising means 
for noise-sdppression in the speech-containing temporal frames before their 
parametrlzation. 



